﻿ Tehnicide IngineriaLimbajului Natural Curs 8 Niveluldiscursului: Teoria Nervurilor (coerență) și rezumare automată Curs: Dan Cristea Laboratoare: Diana Trandabăț, Mihaela Onofrei, Daniela Gîfu, Ionuț PistolThe account of VT on coherence •Veins give a natural way to generalize Centering from local to global Centering Rule 2: transitions Cb(u) = Cb(u-1) Cb(u) ¹Cb(u-1) Cb(u) = Cp(u)CONTINUINGSMOOTH SHIFT Cb(u) ¹Cp(u)RETAININGABRUPT SHIFT CON > RET > SSH > ASH Vein expressions give „lines of argumentation“ 1 Johnsoldhisbicycle 2 althoughBillwouldhavewantedit 3 Heobtainedagoodpriceforit, whichBillcouldnothaveafforded 1 Johnsoldhisbicycle45 Thereforehedecidedtousethemoneytogoonatrip 1 Johnsoldhisbicycle 5V=1 3 5 3 Heobtainedagoodpriceforit, 5 Thereforehedecidedtousethemoneytogoonatrip 3 5123V=1 4 V=1 3 4 5 V=1 2 3 5 V=1 3 5 Lines of argumentation 2 althoughBillwouldhavewantedit 1 Johnsoldhisbicycle 5V=1 3 5 2 althoughBillwouldhavewantedit 3 Heobtainedagoodpriceforit, 5 Thereforehedecidedtousethemoneytogoonatrip 3 5123V=1 4 V=1 3 4 5 V=1 2 3 5 V=1 3 5 Lines of argumentation 3 Heobtainedagoodpriceforit, 1 Johnsoldhisbicycle 5V=1 3 5 3 Heobtainedagoodpriceforit, 5 Thereforehedecidedtousethemoneytogoonatrip 3 5123V=1 4 V=1 3 4 5 V=1 2 3 5 V=1 3 5 Lines of argumentation 4 whichBillcouldnothaveafforded 1 Johnsoldhisbicycle 5V=1 3 5 3 Heobtainedagoodpriceforit, 4 whichBillcouldnothaveafforded 5 Thereforehedecidedtousethemoneytogoonatrip 3 5123V=1 4 V=1 3 4 5 V=1 2 3 5 V=1 3 5 Lines of argumentation 5 Thereforehedecidedtousethemoneytogo onatrip 5V=1 3 5 1 Johnsoldhisbicycle 3 Heobtainedagoodpriceforit, 5 Thereforehedecidedtousethemoneytogoonatrip 3 5123V=1 4 V=1 3 4 5 V=1 2 3 5 V=1 3 5 Computation of longest argumentationlines (al) uV(u)dea(u)al 11351 212351212 313513 41345134134 5135135135 Evaluating the coherence of a discourse •A smoothness score: –CONTINUING = 4 –RETAINING = 3 –SMOOTH SHIFT =2 –ABRUPT SHIFT = 1 –NO Cb = 0 •A global smoothness score:summing up the score of all units The second conjecture (on coherence) •The global smoothness score of a discourse when computed following VT is at least as high as the score computed following CT •But segments, as considered by Centering, typically are developed along veins •When passing segments frontiers, in a linear reading, transitions are usually abrupt •Therefore, what we claim here is that long- distance transitions, as computed along veins, are systematically smoother than accidental transitions at segment boundaries Transitions and scores on a linear adjacency metric J = [John], b = [John's bicycle], B = [Bill], p = [price], m = [the money], t = [a trip]) 12345 CfJ, bB, bJ, p, bp, BJ, m, t CbJbbp- TransASHRETSSHNo Cb Score1320 Global6/4 = 1 5 Transitions and scores on a hierarchical adjacency metric 12134135 CfJ, bB, bJ, bJ, p, bp, BJ, bJ, p, bJ, m, t CbJbJJpJJJ TransASHCONSSHCON Score1424 Global11/4=2 75 Verifying the second conjecture SourceNo of CT scoreAverage VT scoreAverage transitionsCT score VT score per per transitiontransition English59761 25841 38 French471092 351162 47 Romanian651422 181522 34 Total1733271 893522 03 Incremental discourse parsing using VT 15 The account of VT on cohesion and coherence –According to VT, a discourse tree correctly characterizes a text if two constraints are satisfied with respect to the corresponding set of veins: •a cohesioncriterion –a score associated to the resolution of anaphors on antecedents placed on veins is maximized; •a coherencecriterion –along veins (i e not linearly) a global smoothness score of the discourse, which sums up Centering transitions scores, is maximized Put VT at the base of a process of parsing the discourse •VT revealed a hidden structure associated to a text and formulated two constraints •Now, for parsing a text, reverse the process: use criteria evidenced by VT to guide the parsing process –HOW?: among all possible structures a text can have, the correct ones simultaneously observe both cohesion and coherence criteria In incremental discourse parsing two principles are of help The principle of sequenciality •A left to right reading of the terminal frontier of the tree associated with a discourse must correspond to the span of text it analyses in the same left-to-right order The right frontier constraint •A discourse constituent must be attached on the (generalised) right frontier of the ongoing discourse Generalised right frontier and inner-most substitution node inner-most substitution node: the place for the next substitution generalised right frontier: nodes on this frontier are candidates for adjunction Apply TAG inspired operations: substitution k Although Bill would have wanted it, k+1 John sold his bicycle to somebody else tt’ k+1 k+1 kòk Apply TAG inspired operations: adjoining On the right frontier (Polanyi, 1988) ta t’ a sa* a1 sa s0s0s1 s1 Expectations-driven incremental parsing (Cristea and Webber, 1997) EVIDENCE EVIDENCE a Clintonisboundtowintheelections a ANT-CONS b Heisanaturalborncampaigner b ANT-CONS c Ifyouholdsomepositiononanissue,c e d thenifClintonwantstogetyourvote,d e he will assure you with great sincerity that he holds that position too Expectations-driven incremental parsing: variant 1 a Clintonisboundtowintheelections b Heisanaturalborncampaigner EVIDENCE b * a Expectations-driven incremental parsing: variant 1 a Clintonisboundtowintheelections b Heisanaturalborncampaigner c Ifyouholdsomepositiononanissue, EVIDENCE EVIDENCE *ANT-CONS a b c ? Expectations-driven incremental parsing: variant 1 a Clintonisboundtowintheelections b Heisanaturalborncampaigner c Ifyouholdsomepositiononanissue, d hewillassureyouwithgreatsinceritythatheholdsthatpositiontoo EVIDENCE a EVIDENCE bANT-CONS c ? Expectations-driven incremental parsing: variant 1 a Clintonisboundtowintheelections b Heisanaturalborncampaigner c Ifyouholdsomepositiononanissue, d hewillassureyouwithgreatsinceritythatheholdsthatpositiontoo EVIDENCE a EVIDENCE bANT-CONS cd Expectations-driven incremental parsing: variant 2 a Clintonisboundtowintheelections b Heisanaturalborncampaigner c Ifyouholdsomepositiononanissue, d then if Clinton wants to get your vote, EVIDENCE a EVIDENCE bANT-CONS ANT-CONS c d? ? Expectations-driven incremental parsing: variant 2 a Clintonisboundtowintheelections b Heisanaturalborncampaigner c Ifyouholdsomepositiononanissue, d then if Clinton wants to get your vote, e hewillassureyouwithgreatsinceritythatheholdsthatpositiontoo EVIDENCE a EVIDENCE bANT-CONS ANT-CONSc d? Expectations-driven incremental parsing: variant 2 a Clintonisboundtowintheelections b Heisanaturalborncampaigner c Ifyouholdsomepositiononanissue, d then if Clinton wants to get your vote, e hewillassureyouwithgreatsinceritythatheholdsthatpositiontoo EVIDENCE a EVIDENCE bANT-CONS ANT-CONSc de Discourse markers tell a lot about structure (Marcu, 1997, 2000; Cristea et al , 2003, 2005) [Because John is such a generous man1] [–whenever he is asked for money,2] [he will give whatever he has, for example3] [–he deserves the “Citizen of the year” award 4] Reproduced from (Cristea and Webber,1998) because , because-, - 1234 123 12 1234 1342 132 1423 Discourse markers tell a lot about structure [Because John is such a generous man1] [–wheneverhe is asked for money,2] [he will give whatever he has, for example3] [–he deserves the “Citizen of the year” award 4] whenever , whenever-, - 234 23 23 243 Discourse markers tell a lot about structure [BecauseJohn is such a generous man1] [–whenever he is asked for money,2] [he will give whatever he has,for example3] [–he deserves the “Citizen of the year” award 4] , for example -, -for example 123 132 12323 Discourse markers tell a lot about structure [BecauseJohn is such a generous man1] [–wheneverhe is asked for money,2] [he will give whatever he has, for example3] [–he deserves the “Citizen of the year” award 4] There are only two trees that can be obtained after considering all constraints: because -, -because -, - -, -for example41whenever -, - 1whenever -, , -for example 4 2323 because , 1234because , 1234 whenever , 23whenever , 234 , for example123 , for example23 The incremental generation of the first interpretation [Because John is such a generous man1] [–whenever he is asked for money,2] ? because -, - whenever -, - 1* 2 The incremental generation of the first interpretation [Because John is such a generous man1] [–whenever he is asked for money,2] [he will give whatever he has, for example3] because -, - ? 3 1whenever -, - 2 The incremental generation of the first interpretation [Because John is such a generous man1] [–whenever he is asked for money,2] [he will give whatever he has, for example3] [–he deserves the “Citizen of the year” award 4] because -, - ? 4 1whenever -, - 32 The incremental generation of the first interpretation [Because John is such a generous man1] [–whenever he is asked for money,2] [he will give whatever he has, for example3] [–he deserves the “Citizen of the year” award 4] because -, - ? 4 1whenever -, - 32 The incremental generation of the second interpretation [Because John is such a generous man1] [–whenever he is asked for money,2] because -, -whenever -, - 12 The incremental generation of the second interpretation [Because John is such a generous man1] [–whenever he is asked for money,2] [he will give whatever he has, for example3] because -, - -, -for example 1whenever -, - 3 2* The incremental generation of the second interpretation [Because John is such a generous man1] [–whenever he is asked for money,2] [he will give whatever he has, for example3] [–he deserves the “Citizen of the year” award 4] because -, - 4 1whenever -, - -, -for example 23 The incremental generation of the second interpretation [Because John is such a generous man1] [–whenever he is asked for money,2] [he will give whatever he has, for example3] [–he deserves the “Citizen of the year” award 4] because -, - 1whenever -, - -, -for example 4 23 How can references help in discovering the structure? [Because John is such a generous man1] [–whenever heis asked for money,2] [hewill give whatever he has, for example3] [–he deserves the “Citizen of the year” award 4] because -, - because -, - right whenever -, - -, -for example41 wrong , -for example 1whenever -, -4-DEA=1 4 3DEA=2 4 232 DEA=1 2 3 DEA=1 2 DEA=2 3 DEA=1 2 Anaphoric links can indicate the right structure a BecauseMarywasupset, b evenifJohnagreed, c theydidn’tspeaktooneanotherforseveraldays ca abV=(a)cbc DEA=ac V=(a)(b)c DEA=abcright wrong Incremental parallel processing (Cristea et al , 2005) NP-chunkerAR-engine disc parsersummarizer segmentatoredts-builder VT guides an incremental discourse parsing (Cristea, 2000; Cristea et al , 2003, 2005) The tree resulted after parsing is the one which manifests: –the more natural overall references over the discourse structure –the smoothest overall CT transitions on veins The discourse parser implements a „beam search“ N trees observing markers‘ well- formedness constraints reference score cohesion transitions score coherence N Exercise For the sentence: [John is determined to pass the NLP exam 1] so, because[he has missed many courses 2] and[was only vaguely implicated at the working sessions 3] , [he will have a hard time until summer 4] knowing the involved markers have the argument patterns: find the possible parsing trees Deciding the structure 1 so, because2 and3, 4 Have the following domains of arguments: so because and Then, out of all possible combinations, filter out: empty lists, non- adjacent lists and those placed at distance from their markers The following combinations of arguments remain: so because and so because and so because and and In (Cristea et al , 2003)there are expressed a number of rules that can constrain well-formedness of tree structures: “nesting-arguments”, “balanced-displacement”, “unique-root” and “one-parent” Only one observe all constraints VT’s account on Summarisation Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental Pipe-line versus incremental processing pipe-line incremental The architecture NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder Processing: incremental parallel combined with pipe-line NP-chunkerAR-engine parsersummarizerdisc segmentatoredts-builder NP chunker NP-chunkerAR-engine disc parsersummarizer segmentatoredts-builder NP chunker SEROQUEL … The pharmacokinetics of quetiapine were not significantly altered following co-administration with the antipsychotics risperidone or haloperidol However, co-administration of Seroquel and thioridazine caused increases in clearance of iapine …quet AR-engine NP-chunkerAR-engine disc parsersummarizer segmentatoredts-builder AR-engine: referential links SEROQUEL … The pharmacokinetics of quetiapine were not significantly altered following co-administration with the antipsychotics risperidone or haloperidol However, co-administration of Seroquel and thioridazine caused increases in clearance of iapine …quet AR-engine: the architecture text AR-engineAR-model1 AR-model2 AR-model3 anaphoric links Segmentator NP-chunkerAR-engine disc parsersummarizer segmentatoredts-builder Segmentator SEROQUEL … The pharmacokinetics of quetiapine were not significantly altered following co-administration with the antipsychotics risperidone or haloperidol However, co-administration of Seroquel and thioridazine caused increases in earance of quetiapine …cl Elementary Discourse trees (edts)-builder NP-chunkerAR-engine disc parsersummarizer segmentatoredts-builder edts-builder following co-administration with The pharmacokinetics of the antipsychotics risperidone or quetiapine were not significantly haloperidol altered edts-builder ELABORATION followingco-administration with The pharmacokinetics of the antipsychotics risperidone or quetiapine were not significantly haloperidol altered edts-builder However, co-administration of Seroquel and thioridazine caused increases in clearance of quetiapine edts-builder CONCESSION *Ho wever, co-administration of Seroquel and thioridazine caused increases in clearance of quetiapine Discourse parser NP-chunkerAR-engine disc parsersummarizer segmentatoredts-builder Operations Adjoining: always on the (generalised) right frontier of the developing tree (Cristea & Webber, 1997) pdtipdti -1 edti * foot node Operations Substitution: on the most inner substitution node of the developing tree (Cristea & Webber, 1997) pdtipdti -1 edti ¯ substitution node Evocative and post-evocative references most frequent direct reference indirect reference inferential reference less frequent Inferential references, by type of anaphor 4% the 4 (Gundel et al , 1993; Ide and Cristea, 2001) mber Remential inferences! refere type of anaphordisobeying VT pragmatic56 3% propernouns22 7%Evocative power commonnouns16 0% pronouns5 0% Reference score •To score references in relation with veinswe have given the values 2, 1and 0for the values direct, indirectand inferential, respectively •Then, to score the anaphor type we have given the values 3, 2and 1for the following categories of anaphors: pronoun, common nounand proper noun, respectively •Then we have multiplied these scores for each anaphor, allowing each anaphor to contribute to the general score of the tree with a value between 0 and 6: –0 = any of its antecedents are outside the DEAof anaphor’s unit –… –6 = pronoun whose most recent antecedent is included inthe DEAof the unit the anaphor belongs to Transitions score •A local smoothness score: –CONTINUING = 4 –RETAINING = 3 –SMOOTH SHIFT =2 –ABRUPT SHIFT = 1 –NO Cb = 0 •A global smoothness score:summing up the transition scores of all units The Discourse Parser combines the two scores (Cristea, 2000) Veins Theory guides an incremental discourse parsing process The tree resulted after parsing is the one which manifests: –the more natural overall references over the discourse structure (cohesion) –the smoothest overall Centeringtransitions on veins(coherence) Incremental parallel processing NP-chunkerAR-engine disc parsersummarizer segmentatoredts-builder Focused summaries can easily be obtained by VT Sincethe vein expressionof a discourse unit is meant to give the minimal sequence of units that are required to understand just that unit in the context of the whole discourseè the vein expression of a unit gives also a summary of the discourse focused on that unit The text George Orwell: “1984” Annotation: edus are numbered Winston was dreaming of his mother 1/ Winston o visa pe mama lui He must2/, he thought3/, have been ten or eleven years old2/ when his mother had disappeared 4/ Trebuie să fi avut, se gândi el, vreo zece-unsprezece ani când mama lui dispăruse She was a tall, statuesque, rather silent woman with slow movements and magnificent fair hair 5/ Era o femeie înaltă, ca o statuie, şi destul de tăcută; avea mişcări lente şi un păr blond minunat The summary gold # of hits # of units 12345678910118283 The summary gold # of hits # of units 12345678910118283 No ofedts No of edusNo of sentences of this No of generated edts lengthper sentence 1÷3251÷4 4÷595÷28 6142 VT guides the summarisation process Evaluation in different points Impact of different modules minus 4 5%minus 10 4% NP-AR-engineAR-engine chunker parsersummarizedisc r segmentatsegmentatedt- orordetector minus 52 9%Rezumare: un extras din “Legendele Olimpului” Zeul-dupăaceea-acoborîtînlumeşiacutreieratprinţărinenumărate,căutîndu-şilocdetemplu Şi,totumblîndpedrumuri, Apoloaajunsînniştelocuristranii,stîncoase,mohorîte PasămitesosiseînmunteleParnas Aicisîntniştevîrfuridepiatră cenuşie,ceparcăstausăcadăîntr-unabisdesmoalăclocotitordevuiet Şuvoaieledeaer-unaercaldsiumed-ţîşnesc necontenitşivorbelerostitese-ntorciute-necouri,sunînddinstîncă-nstîncă Pădurea-imisterioasă Frunzişuleişopteşte Unizvorîşiprăvaleapasacristalinăsprevalea-nneguratăsărindpestepripoare Şi-nmunte-iostrîmtoareafundă,mocirloasă,undetraiaatuncea-precumspunealegenda-unşarpenumitPiton Trupul acestuimonstrueralungcîtunmunte,cu-osutădegrumazurişiosutădeguri,careazvîrleauflăcări,duhoriînecăcioase Duhorileacesteaînţesautotvăzduhulşioameni,fiare,gîze,păsărisaualtefiinţe,dacătreceaupeacolo,sestingeauotrăvite Şarpelesenăscusedintr-omocirlăverde Mocirlarămăsesedinapelevărsateacolo,prinpotopulorînduitdeZeus Pitonprimise-ntainăporuncădelaHerasă-lpîndeascăpeApolo,cîndvatreceprinmunte,şisă-irăpunăviaţa Hera-lurapefiul celnounăscutalLetei,pentrucăsoţulsău,preaputerniculZeus,ţineamaimultladînsuldecitlafiiiei:HefaistosşiAres CîndaajunsApoloînmunteleParnas,dihaniauriaşăs-aavîntatspredînsul,dornicăsă-lucidă Darzeulşi-aîntinsarcul Atras primasăgeată Eraudoarpatruziledecîndvăzuselumea,şiîntiialuisăgeatăaşinimeritmonstrul EdreptcăşisăgeatafusesefăuritădefieraruldinLemnos,Hefaistos,schilodul,cumaremeşteşug Cumi-apătrunssăgeata încarneamucedă,şarpeleasimţitcăi-avenitsfîrşitul Aîncercatsăscapeşisăsefurişezeîntr-opesterăadîncă Darn-a maiizbutit Chinuitdedureri,Pitons-azvîrcolitprinverzilemocirleîncarevieţuia Asăritînnisipulcelgalbendepemalurişialunecat iară,cuşuierecumplite,înputredabaltoacă,învremeceveninulisescurgeadinboturi Monstruls-azvîrcolittotmaiîncetînapă Ochiiis-aulăsat S-aucufundatpeîncetul,detot,înîntuneric Aşas-aprăpădit Iar trupulsăulungitpezecişizecidestînjeni,is-auscatcutimpul,subrazeledesoare Şilocurileacesteaunde-arăpusApolopefiorosulmonstruaufostnumiteDelfi ÎnDelfisevorpunetemeliiunuitempluşi-unuivestitoracol,ahotărîtApolo Oameniivorputeasă-ntrebeşisăaflerăspunsla întrebări,prinpreuteasamea,Pitia,înţeleapta Easevaaşezapeuntrepieddeaur,învăluitînpielealuiPitoncelrăpus, deasuprauneigroteadîncisifumegînde,ceducepînă-nţaraluiHades,mohorîtul Prineavoidarăspunsuri Daroamenii datorisîntsă-midăruiascăaurşisculepreţioaseşizecidebogăţii,răsplatăpentrumineşipreoţiidintemplu Maiporuncesc, de-asemeni,săaibăloc,încinstealupteimelecumonstrul,jocuri,serbări,concursuri,şicîntece,şidanţuri Săfieveselie, iareusăfiuslăvit Dupăaceeazeul,pătatîncădesînge-desîngeleluiPiton-apornitsăsespele,să-şipurificetrupul,înmiculrîuTempe Rezumare: un extras din “Legendele Olimpului” Prima mențiune a lui Hefaistos Zeul-dupăaceea-acoborîtînlumeşiacutreieratprinţărinenumărate,căutîndu-şilocdetemplu Şi,totumblîndpedrumuri, Apoloaajunsînniştelocuristranii,stîncoase,mohorîte PasămitesosiseînmunteleParnas Aicisîntniştevîrfuridepiatră cenuşie,ceparcăstausăcadăîntr-unabisdesmoalăclocotitordevuiet Şuvoaieledeaer-unaercaldsiumed-ţîşnesc necontenitşivorbelerostitese-ntorciute-necouri,sunînddinstîncă-nstîncă Pădurea-imisterioasă Frunzişuleişopteşte Unizvorîşiprăvaleapasacristalinăsprevalea-nneguratăsărindpestepripoare Şi-nmunte-iostrîmtoareafundă,mocirloasă,undetraiaatuncea-precumspunealegenda-unşarpenumitPiton Trupul acestuimonstrueralungcîtunmunte,cu-osutădegrumazurişiosutădeguri,careazvîrleauflăcări,duhoriînecăcioase Duhorileacesteaînţesautotvăzduhulşioameni,fiare,gîze,păsărisaualtefiinţe,dacătreceaupeacolo,sestingeauotrăvite Şarpelesenăscusedintr-omocirlăverde Mocirlarămăsesedinapelevărsateacolo,prinpotopulorînduitdeZeus Pitonprimise-ntainăporuncădelaHerasă-lpîndeascăpeApolo,cîndvatreceprinmunte,şisă-irăpunăviaţa Hera-lurapefiul celnounăscutalLetei,pentrucăsoţulsău,preaputerniculZeus,ţineamaimultladînsuldecitlafiiiei:HefaistosşiAres CîndaajunsApoloînmunteleParnas,dihaniauriaşăs-aavîntatspredînsul,dornicăsă-lucidă Darzeulşi-aîntinsarcul Atras primasăgeată Eraudoarpatruziledecîndvăzuselumea,şiîntiialuisăgeatăaşinimeritmonstrul EdreptcăşisăgeatafusesefăuritădefieraruldinLemnos,Hefaistos,schilodul,cumaremeşteşug Cumi-apătrunssăgeata încarneamucedă,şarpeleasimţitcăi-avenitsfîrşitul Aîncercatsăscapeşisăsefurişezeîntr-opesterăadîncă Darn-a maiizbutit Chinuitdedureri,Pitons-azvîrcolitprinverzilemocirleîncarevieţuia Asăritînnisipulcelgalbendepemalurişialunecat iară,cuşuierecumplite,înputredabaltoacă,învremeceveninulisescurgeadinboturi Monstruls-azvîrcolittotmaiîncetînapă Ochiiis-aulăsat S-aucufundatpeîncetul,detot,înîntuneric Aşas-aprăpădit Iar trupulsăulungitpezecişizecidestînjeni,is-auscatcutimpul,subrazeledesoare Şilocurileacesteaunde-arăpusApolopefiorosulmonstruaufostnumiteDelfi ÎnDelfisevorpunetemeliiunuitempluşi-unuivestitoracol,ahotărîtApolo Oameniivorputeasă-ntrebeşisăaflerăspunsla întrebări,prinpreuteasamea,Pitia,înţeleapta Easevaaşezapeuntrepieddeaur,învăluitînpielealuiPitoncelrăpus, deasuprauneigroteadîncisifumegînde,ceducepînă-nţaraluiHades,mohorîtul Prineavoidarăspunsuri Daroamenii datorisîntsă-midăruiascăaurşisculepreţioaseşizecidebogăţii,răsplatăpentrumineşipreoţiidintemplu Maiporuncesc, de-asemeni,săaibăloc,încinstealupteimelecumonstrul,jocuri,serbări,concursuri,şicîntece,şidanţuri Săfieveselie, iareusăfiuslăvit Dupăaceeazeul,pătatîncădesînge-desîngeleluiPiton-apornitsăsespele,să-şipurificetrupul,înmiculrîuTempe Rezumare: un extras din “Legendele Olimpului” A doua mențiune a lui Hefaistos Zeul-dupăaceea-acoborîtînlumeşiacutreieratprinţărinenumărate,căutîndu-şilocdetemplu Şi,totumblîndpedrumuri, Apoloaajunsînniştelocuristranii,stîncoase,mohorîte PasămitesosiseînmunteleParnas Aicisîntniştevîrfuridepiatră cenuşie,ceparcăstausăcadăîntr-unabisdesmoalăclocotitordevuiet Şuvoaieledeaer-unaercaldsiumed-ţîşnesc necontenitşivorbelerostitese-ntorciute-necouri,sunînddinstîncă-nstîncă Pădurea-imisterioasă Frunzişuleişopteşte Unizvorîşiprăvaleapasacristalinăsprevalea-nneguratăsărindpestepripoare Şi-nmunte-iostrîmtoareafundă,mocirloasă,undetraiaatuncea-precumspunealegenda-unşarpenumitPiton Trupul acestuimonstrueralungcîtunmunte,cu-osutădegrumazurişiosutădeguri,careazvîrleauflăcări,duhoriînecăcioase Duhorileacesteaînţesautotvăzduhulşioameni,fiare,gîze,păsărisaualtefiinţe,dacătreceaupeacolo,sestingeauotrăvite Şarpelesenăscusedintr-omocirlăverde Mocirlarămăsesedinapelevărsateacolo,prinpotopulorînduitdeZeus Pitonprimise-ntainăporuncădelaHerasă-lpîndeascăpeApolo,cîndvatreceprinmunte,şisă-irăpunăviaţa Hera-lurapefiul celnounăscutalLetei,pentrucăsoţulsău,preaputerniculZeus,ţineamaimultladînsuldecitlafiiiei:HefaistosşiAres CîndaajunsApoloînmunteleParnas,dihaniauriaşăs-aavîntatspredînsul,dornicăsă-lucidă Darzeulşi-aîntinsarcul Atras primasăgeată Eraudoarpatruziledecîndvăzuselumea,şiîntiialuisăgeatăaşinimeritmonstrul EdreptcăşisăgeatafusesefăuritădefieraruldinLemnos,Hefaistos,schilodul,cumaremeşteşug Cumi-apătrunssăgeata încarneamucedă,şarpeleasimţitcăi-avenitsfîrşitul Aîncercatsăscapeşisăsefurişezeîntr-opesterăadîncă Darn-a maiizbutit Chinuitdedureri,Pitons-azvîrcolitprinverzilemocirleîncarevieţuia Asăritînnisipulcelgalbendepemalurişialunecat iară,cuşuierecumplite,înputredabaltoacă,învremeceveninulisescurgeadinboturi Monstruls-azvîrcolittotmaiîncetînapă Ochiiis-aulăsat S-aucufundatpeîncetul,detot,înîntuneric Aşas-aprăpădit Iar trupulsăulungitpezecişizecidestînjeni,is-auscatcutimpul,subrazeledesoare Şilocurileacesteaunde-arăpusApolopefiorosulmonstruaufostnumiteDelfi ÎnDelfisevorpunetemeliiunuitempluşi-unuivestitoracol,ahotărîtApolo Oameniivorputeasă-ntrebeşisăaflerăspunsla întrebări,prinpreuteasamea,Pitia,înţeleapta Easevaaşezapeuntrepieddeaur,învăluitînpielealuiPitoncelrăpus, deasuprauneigroteadîncisifumegînde,ceducepînă-nţaraluiHades,mohorîtul Prineavoidarăspunsuri Daroamenii datorisîntsă-midăruiascăaurşisculepreţioaseşizecidebogăţii,răsplatăpentrumineşipreoţiidintemplu Maiporuncesc, de-asemeni,săaibăloc,încinstealupteimelecumonstrul,jocuri,serbări,concursuri,şicîntece,şidanţuri Săfieveselie, iareusăfiuslăvit Dupăaceeazeul,pătatîncădesînge-desîngeleluiPiton-apornitsăsespele,să-şipurificetrupul,înmiculrîuTempe Un rezumat obişnuit Zeul-dupăaceea-acoborîtînlumeşiacutreieratprinţărinenumărate,căutîndu-şilocdetemplu Şi,tot umblîndpedrumuri,Apoloaajunsînniştelocuristranii,stîncoase,mohorîte Pasămitesosiseînmuntele Parnas Şi-nmunte-iostrîmtoareafundă,mocirloasă,undetraiaatuncea-precumspunealegenda-unşarpenumit Piton Pitonprimise-ntainăporuncădelaHerasă-lpîndeascăpeApolo,cîndvatreceprinmunte,şisă-irăpunăviaţa CîndaajunsApoloînmunteleParnas,dihaniauriaşăs-aavîntatspredînsul,dornicăsă-lucidă Darzeulşi-a întinsarcul Atrasprimasăgeată Eraudoarpatruziledecîndvăzuselumea,şiîntiialuisăgeatăaşi nimeritmonstrul Monstruls-azvîrcolittotmaiîncetînapă Aşas-aprăpădit Dupăaceeazeul,pătatîncădesînge-desîngeleluiPiton-apornitsăsespele,să-şipurificetrupul,înmicul rîuTempe Un rezumat focalizat pe prima apariţie a lui Hefaistos Zeul-dupăaceea-acoborîtînlumeşiacutreieratprinţărinenumărate Apoloaajunsînniştelocuristranii, stîncoase,mohorîte PasămitesosiseînmunteleParnas Şi-nmunte-iostrîmtoareafundă,mocirloasă,undetraiaatunceaunşarpenumitPiton Pitonprimise-ntainăporuncădelaHerasă-lpîndeascăpeApolo,cîndvatreceprinmunte,şisă-irăpunăviaţa Hera-lurapefiulcelnounăscutalLetei,pentrucăsoţulsău,preaputerniculZeus,ţineamaimultladînsul decitlafiiiei:HefaistosşiAres CîndaajunsApoloînmunteleParnas,dihaniauriaşăs-aavîntatspredînsul,dornicăsă-lucidă Darzeulşi-a întinsarcul Atrasprimasăgeată Eraudoarpatruziledecîndvăzuselumea,şiîntiialuisăgeatăaşi nimeritmonstrul Monstruls-azvîrcolittotmaiîncetînapă Aşas-aprăpădit Dupăaceeazeul,pătatîncădesînge-desîngeleluiPiton-apornitsăsespele,să-şipurificetrupul,înmicul rîuTempe Un rezumat focalizat pe a douaapariţie a lui Hefaistos Zeul-dupăaceea-acoborîtînlumeşiacutreieratprinţărinenumărate Apoloaajunsînniştelocuristranii, stîncoase,mohorîte PasămitesosiseînmunteleParnas Şi-nmunte-iostrîmtoareafundă,mocirloasă,undetraiaatunceaunşarpenumitPiton Pitonprimise-ntainăporuncădelaHerasă-lpîndeascăpeApolo,cîndvatreceprinmunte,şisă-irăpunăviaţa dihaniauriasăs-aavîntatspredînsul,dornicăsă-lucidă Darzeulşi-aîntinsarcul Atrasprimasăgeată şiîntîialuisăgeatăaşinimeritmonstrul EdreptcăşisăgeatafusesefăuritădefieraruldinLemnos,Hefaistos,schilodul,cumaremeşteşug Monstruls-azvîrcolittotmaiîncetînapă Aşas-aprăpădit Dupăaceeazeul,pătatîncădesînge-desîngeleluiPiton-apornitsăsespele,să-şipurificetrupul,înmicul rîuTempe References Cristea, D , Ide, N and Romary, L (1998): Veins Theory An Approach to Global Cohesion and Coherence In Proceedings of Coling/ACL ‘98, Montreal, https://profs info uaic ro/~dcristea/papers/CristeaIdeRomary-Coling98 pdf Cristea, D , Ide, N , Marcu, D and Tablan, M -V (2000): Discourse Structure and Co- Reference: An Empirical Study, In Proceedings of The 18th International Conference on Computational Linguistics COLING'2000, Luxembourg, https://profs info uaic ro/~dcristea/papers/Cristea-etal-coling2000 pdf Ide, N and Cristea, D (2000): A Hierarchical Account of Referential Accessibility In Proceedings of The 38th Annual Meeting of the Association for Computational Linguistics, ACL'2000, Hong Kong, https://profs info uaic ro/~dcristea/papers/IdeCristea- ACL2000 pdf Cristea, D , Postolache, O , Pușcasu, G and Ghețu, L (2003) Summarizing documents based on cue-phrases and references, inProceedings of the International Symposium on Reference Resolution and Its Applications to Question Answering and Summarization, Venice, Italy, June https://profs info uaic ro/~dcristea/papers/cristea-etal- venezia2003 pdf Dan Cristea, Oana Postolache, Ionuț Cristian Pistol (2005) Summarisation through Discourse Structure In Alexander Gelbukh (ed ):Computational Linguistics and Intelligent Text Processing, 6th International Conference CICLing 2005, Mexico City, Mexico, February 13-19 2005, Proceedings, Springer LNSC, vol 3406, p 632-644 https://profs info uaic ro/~dcristea/papers/cristea-postolache-pistol-cicling05 pdf Cristea, D (2005): Motivations and Implications of Veins Theory, in B Sharp (Ed ) Natural Language Understanding and Cognitive Science, Proceedings of the 2nd International Workshop on Natural Language Understanding and Cognitive Scienc3, NLUCS 2005, in conjunction with ICEIS 2005, Miami, U S A , May 2005, INSTICC Press, https://profs info uaic ro/~dcristea/papers/cristea-nlucs2005 pdf